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Abstract 

A Keith number is a positive integer N with the decimal rep- 
resentation a±a2 ■ ■ ■ a n such that n > 2 and TV" appears in the se- 
quence (K m ) m >i given by the recurrence K\ = a±, . . . , K n = a n and 
K m = K m ^i + K m _2 + • • • + K m - n for m > n. We prove that there 
are only finitely many Keith numbers using only one decimal digit 
(i.e., a% = ci2 = • • • = a n ), and that the set of Keith numbers is of 
asymptotic density zero. 

1 Introduction 

With the number 197, let (K m ) m >i be the sequence whose first three terms 
K\ = 1, K2 = 9 and K3 = 7 are the digits of 197 and which satisfies the 
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recurrence K m = K m _i + K m _ 2 + K m _ 3 for all m > 3. Its initial terms are 

1, 9, 7, 17, 33, 57, 107, 197, 361, 665,... 

Note that 197 itself is a member of this sequence. This phenomenon was 
first noticed by Mike Keith and such numbers are now called Keith numbers. 
More precisely, a number N with decimal representation a\a 2 . . . a n is a Keith 
number if n > 2 and N appears in the sequence K = (K^) m >i whose 
n initial terms are the digits of iV read from left to right and satisfying 
K% = K^_i + K^_ 2 + • • • + K^_ n for all m> n. These numbers appear in 
Keith's papers j3j and [3] and they are the subject of entry A007629 in Neil 
Sloane's Encyclopedia of Integer Sequences [TT] (see also [7j, [H] and 0). 

Let K be the set of all Keith numbers. It is not known if K is infinite or 
not. The sequence /C begins 

14, 19, 28, 47, 61, 75, 197, 742, 1104, 1537, 2208,2580, 3684, 4788,... 

In total there are 94 Keith numbers smaller than 10 29 ([4]). Recall that a rep- 
digit is a positive integer iV of the form a(10 n — l)/9 for some a € {1, . . . , 9} 
and n > 1; i.e., a number which is a string of the same digit a when written 
in base 10. Our first result shows that there are only finitely many Keith 
numbers which are rep-digits. 

Theorem 1. There are only finitely many Keith numbers which are rep-digits 
and their set can be effectively determined. 

We point out that some authors refer to the Keith numbers as replicating 
Fibonacci digits in analogy with the Fibonacci sequence (F n ) n >i given by 
F\ = 1, F 2 = 1 and F n+2 = F n+ i + F n for all n > 1. In [5] it is shown that 
the largest rep-digit Fibonacci number is 55. 

The proof of Theorem uses Baker's type estimates for linear forms 
in logarithms. It will be clear from the proof that it applies to all base b 
Keith numbers for any fixed integer b > 3, where these numbers are defined 
analogously starting with their base b expansion (see the remark after the 
proof of Theorem [TJ . 

For a positive integer x we write K{x) — K D [1, x\. As we mentioned 
before, /C(10 29 ) = 94. A heuristic argument in jl] suggests that #/C(a;) ^> 
logx, and, in particular, that /C should be infinite. Going in the opposite 
way, we show that /C is of asymptotic density zero. 
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Theorem 2. The estimate 

#/c(x) « -JL= 

Vlogx 

holds for all positive integers x > 2. 

The above estimate is very weak. It does not even imply that that sum of 
the reciprocals of the members of K, is convergent. We leave to the reader the 
task of finding a better upper bound on #K{x). Typographical changes (see 
the remark after the proof of Theorem |2J) show that Theorem |2] also is valid 
for the set of base 6 Keith numbers if 6 > 4. Perhaps it can be extended also 
to the case 6 = 3. For 6 = 2, Kenneth Fan has an unpublished manuscript 
showing how to construct all Keith numbers (see [4 ) and that, in particular, 
there are infinitely many of them. For example, any power of 2 is a binary 
Keith number. 

Throughout this paper, we use the Vinogradov symbols 3> and <C as well 
as the Landau symbols O and o with their usual meaning. Recall that for 
functions A and B the inequalities A < B, B > ,4 and A = 0(B) are all 
equivalent to the fact that there exists a positive constant c such that the 
inequality \A\ < cB holds. The constants in the inequalities implied by these 
symbols may occasionally depend on other parameters. For a real number x 
we use logo; for the natural logarithm of x. For a set A, we use j^A and |^4| 
to denote its cardinality. 
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2 Preliminary Results 

For an integer N > 0, recall the definition of the sequence K N = (K^) m >i 
given in the Introduction. In K N we allow iV to be any string of the 
digits 0,1,..., 9, so iV may have initial zeros. So, for example, K 020 = 
(0, 2, 0, 2, 4, 6, 12, 22, . . . ). For n > 1 we define the sequence L n as L n = K M 
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where M = 11... 1 with n digits 1. In particular, L 1 = (1,1,1,...) and 
L 2 = (1,1,2,3,5,8,...), the Fibonacci numbers. In the following lemma, 
which will be used in the proofs of both Theorems 1 and 2, we establish 
some properties of the sequences K N and L n . 

Lemma 1. Let N be a string of the digits 0, 1, . . . , 9 with length n > 1. // 
N does not start with 0, we understand it also as the decimal representation 
of a positive integer. 

1. If N has at least k > 1 nonzero entries, then > L^ +m _ n holds for 
every m > n + 1. 

2. If N has at least one nonzero entry, then > L^_ n holds for every 
m > n + 1. We have < 9L^ for every m > 1. 

3. Ifn>3 and N = for some m > 1 (so N is a Keith number), then 
2n < m < 7n. 

4- For fixed n > 2 and growing m > n + 1, 

L n m = 2 m - n -\n - 1)(1 + 0(m/2 n )) + 1 

where the constant in O is absolute. 

Proof. 1. By the recurrences defining K N and L h , the inequality clearly holds 
for the first k indices m — n + 1, n + 2, . . . , n + k. For m > n + k it holds by 
induction. 

2. We have K% > 1 = L n m _ n for m = n + 1, n + 2, . . . , 2n and the 
inequality holds. For m > 2n it holds by induction. The second inequality 
follows easily by induction. 

3. The lower bound m > 2n follows from the fact that K N is nondecreas- 
ing and that 

K 2n < 9L 2n = 9 ' T ^ ( n ~ X ) + 9 < < N 

for n > 3. To obtain the upper bound, note that for m > n we have by 
induction that > L^_ n+2 > m ~ n where = 1.61803. . . is the golden 
ratio. Thus, by part 2, 

10" > N = K» > L n m __ n > r~ 2n 
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and m < (2 + log 10/ \og(p)n < In. 

4. We write L 7 ^ in the form L 7 ^ = (2 m ~ n ~ 1 — d(m))(n — 1) + 1 and prove 
by induction on m that for m > n + 1, 

< d{m) < m2 m - 2n . 

This will prove the claim. 

It is easy to see by the recurrence that L™ +1 , L™ +2 , . . . , L 2n+1 are equal, 
respectively, to 2°(n - 1) + 1, 2 1 (n - 1) + 1, . . . , 2 n (n - 1) + 1. So d(m) = 
for n + l<m<2n + l and the claim holds. For m > 2n + 1, 

rra _ rn , r n _i_ , r n 

n 

= ^ (^2 m - n ' 1 - k - d(m - k))(n - 1) + lj 
fc=i 

n 

= ^m-n-l _ 2 m-2n-l + j _ ^ d{m - jfe)) (n - 1) + 1 

fc=l 

and the induction hypothesis give 

n 

< dim) = 2 m ~ 2n - 1 -1 + J2 d ( m ~ k ) 



fc=i 

n 

m—2n—k 

k=l 



< m2 m - 2n . 



□ 



In part 4, if m is roughly of size 2 n and larger then the error term swallows 
the main term and the asymptotics is useless. Indeed, the correct asymptotics 
of when m — - > oo is ca m where c > is a constant and a < 2 is the only 
positive root of the polynomial x n —x n ~ 1 — - ■ ■— x— 1. But for m small relative 
to 2 n , say m = 0(n) (ensured for Keith numbers by part 3), this "incorrect" 
asymptotics of L 7 ^ is very precise and useful, as we shall demonstrate in the 
proofs of Theorems Q and 121 

In the proof of Theorem ^ we will apply also a lower bound for a linear 
form in logarithms. The following result can be deduced from Corollary 2.3 

Of0. 
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Lemma 2. Let Ai,...,Ak, A4 > 1, and ni, ...,7ifc be integers, and let 
N = max{|ni|, . . . , |nfc|, 2}. There exist positive absolute constants c\ and 
C2 (which are effective), such that if 

K = ni log Ai + n 2 log A 2 H h n k log A k ^ 0, 

then 

log |A| > -cit^(logAi) . . . (logA fc ) log iV. 

For the proof of Theorem 2 we will need an upper bound on sizes of 
antichains (sets of mutually incomparable elements) in the poset (partially 
ordered set) 

P(k,n) = ({l,2,...,k} n ,< p ) 
where < p is the product ordering 

a = (ai, a 2 , ■ ■ ■ , a n ) < p b = (61, b 2 , ■ ■ ■ , b n ) <^=^ a» < h for z = 1, 2, . . . , n. 

We have n)| = k n and for k = 2 the poset P(2, n) is the Boolean poset 
of subsets of an n-element set ordered by inclusion. The classical theorem 
of Sperner (see pQ or [2|) asserts that the maximum size of an antichain in 
P(2,n) equals to the middle binomial coefficient (ij^i)- In the next lemma 
we obtain an upper bound for any k > 2. 

Lemma 3. If k > 2,n > 1 and X C P(k,n) is an antichain to < p , then 

Proof. We proceed by induction on k. For k = 2 this bound holds by 
Sperner's theorem because 

n \ 2 n 

< 



[n/2\J n 1 / 2 

for every n > 1. Let k > 3 and X C n) be an antichain. For A running 
through the subsets of [n] = {1,2, . . . ,n}, we partition X in the sets Xa 
where Xa consists of the u G X satisfying m = k <^=^> i G A. If we delete 
from all u G Xa all appearances of k, we obtain (after appropriate relabelling 
of coordinates) a set of \Xa\ distinct (n — |A|)-tuples from P(k — 1, n — \A\) 
that must be an antichain to < p . Thus, by induction, for \A\ < n we have 

((fc-l)/2)-(fc-l)"-|A| 
|Xa| < (n-|A|)V2 
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and \X\ n ]\ < 1. Summing over all As and using the inequality ^Jn/m < 
(n + l)/(m + 1) (which holds for 1 < m < n) and standard properties of 
binomial coefficients, we get 

1*1 = 

Ac[n] 

n-l 



< 



^ (n\ ({k - l)/2) ■ (fc - 1)" 



i=0 

i H— 1 / \ 

I / n \ 

n—i+1 



< 



< 



n/(n — i) ■ (k — 1) 



n+1 



□ 

We conclude this section with three remarks as to the last lemma. 

1. Various generalizations and strengthenings of Sperner's theorem were 
intensively studied, see, e.g., the book of Engel and Gronau 0. Therefore, 
we do not expect much originality in our bound. 

2. It is clear that for k = 2 the exponent 1/2 of n in the bound of 
Lemma El cannot be increased. The same is true for any k > 3. We briefly 
sketch a construction of a large antichain when k = 3; for k > 3 similar 
constructions can be given. For k = 3 and n = 3m > 3 consider the set 
X C P(3, n) consisting of all u which have i Is, n — 2i 2s and i 3s, where 
i — 1, 2, . . . , m = n/3. It follows that X is an antichain and that 

n \ s-^ n\ 



i=l 



x\=yi )=y 



i,i,n — 2i) ^—f (i\) 2 (n — 2i)\ 

y i=l v 



By the usual estimates of factorials, if m — \fn < i < m then 



n \ ( n \ 3 r 

» > 



i,i,n — 2i) \m,m,mj n 
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Hence X is an antichain in P(3, n) with size 

i — 3 n 3 n 
X > Vn- — = 

n y/n 

3. For composite k we can decrease the factor k/2 in the bound of 
Lemma El Suppose that k = Im where I > m > 2 are integers and let 
X C P{k, n) be an antichain. We associate with every u G X the pair of 
n-tuples {v u , id") G P(m, n) x P(/, n) defined by = Ui — m\ui/m\ +m and 

= \ui/m\, 1 < i < n. Note that the pair (v u ,w u ) uniquely determines 
u and that if w u = w u then v u and v u are incomparable by < p . Thus, by 
Lemma El for fixed w G P(l,n) there are less than (m/2)m n / ' ^fn elements 
u £ X with w u = w. The number of ws is at most \P(l, n)\ = l n . Hence 

< (m/2) ■ m" p = (m/2)-^ 

In particular, if is a power of 2 then \X\ < k n / ^fn for every antichain 
XcP(Jt,n). 



3 The proof of Theorem U 

Let N = a(10 n - l)/9 = aa ... a, 1 < a < 9, be a rep-digit. Since if^ = aL n , 
iV is a Keith number if and only if the repunit M = (10 n — l)/9 = 11 . . . 1 is 
a Keith number. Suppose that M is a Keith number: for some m we have 

M = = C = ^^(n -l)(l + 0g)), 

where the asymptotics was proved in Lemma 04. We rewrite this relation as 

9(n - 1) ~ ~~ 9(n - l)2 m - n ~ 1 + V2V ' 
Since 2n < m < 7n by Lemma d3, we get 

r\2n+l-mcn , „ * 

— L_i = f-V 

9(n-l) \2 n / 
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Because 5 n > 9(n — 1) for every n > 1, the left side is always non-zero. 
Writing it in the form e A — 1 and using that e A — 1 = 0(A) (as A — > 0), we 
get 

Tl 

^ A = (2n + 1 -m) log 2 + nlog5 - log(9(n - 1)) < — . 

2 n 

Taking logarithms and applying Lemma El we finally obtain 

— d(\ogn) 2 < log |A| < c(logn — nlog2) 

where c, d > are effectively computable constants. This implies that n is 
effectively bounded and completes the proof of Theorem ^ □ 

Remark. The same argument shows that for every integer b > 3 there are 
only effectively finitely many base b rep-digits, i.e., positive integers of the 
form a(b n — l)/(b— 1) with a G {1, . . . , 6—1}, which are base b Keith numbers. 
Indeed, we argue as for b = 10 and derive the equation 

b n 

- 1 = 0(n/2 n ). 

(6- l)(ra- l)2 m -™- 1 v ' ' 

In order to apply Lemma El we need to justify that the left side is not zero. 
If b is not a power of 2, it has an odd prime divisor p, and p n cannot be 
cancelled, for big enough n, by (b — l)(n — 1). If b > 3 is a power of 2, then 
b — 1 is odd and has an odd prime divisor, which cannot be cancelled by the 
rest of the expression. 



4 The proof of Theorem [21 

For an integer > 0, we denote by n the number of its digits: 10 n_1 < N < 
10 n . We shall prove that there are <C 10 n /\/n Keith numbers with n digits; 
it is easy to see that this implies Theorem 2. There are only few numbers 
with n digits and > n/2 zero digits: their number is bounded by 




< n2 n 9 n/2 = n6 n < (10 n ) 



Hence it suffices to count only the Keith numbers with n digits, of which at 
least half are nonzero. 

Let N be a Keith number with n > 3 digits, at least half of them nonzero. 
So, A^ = for some index m > 1. By Lemma ^3, 2n < m < 7n and we 



9 



may use the asymptotics in Lemma QJ4. Setting k = \n/2\ and using the 
inequality in Lemma we get 

10r>N = K%> L k k+m _ n . 



Lemma ^4 gives that for big 



n 



2 m - n - 1 (k-l] 



L" — I — 511 Tl ^ ■ . ^ 



J k+m—n - 2 12 



On the other hand, the second inequality in Lemma^2 and Lemma^4 give, 
for big n, 

lO™" 1 < N = <9L^<9- 2 m ~ n n. 



Combining the previous inequalities, we get 

10 n 

< 2 m ~ n n < 12 • 10 n 

90 

This implies that, for n > no, the index m attains at most 12 distinct values 
and 

m = (l + logl0/log2 + o(l))n = (« + o(l))n. 

Now we partition the set S of considered Keith numbers (with n digits, 
at least half of them nonzero) in blocks of numbers iV having the same 
value of the index m and the same string of the first (most significant) k = 
[n/2\ digits. So, we have at most 12 ■ 10 fe blocks. We show in a moment 
that the numbers in one block B, when regarded as (n — fe)-tuples from 
P(10, n — k), form an antichain to < p . Assuming this, Lemma El implies that 
\B\< 10 n ~ k+1 /2\/n^ r k. Summing over all blocks, we get 

inn-fe+l inn 

|g|<12-10 fc - iU . «^=, 

2^n~^k \fn 



which proves Theorem 2. 

To show that B is an antichain, we suppose for the contradiction that N\ 
and N% are two Keith numbers from B with N\ < p N 2 . Let M = N 2 — N\ and 
M* = 00 . . . 0M G P(10, n) (we complete M to a string of length n by adding 
initial zeros). It follows that M has at most n — k digits and M < 10 n_fc . 
On the other hand, by the linearity of recurrence and by Ni < p N 2 , we have 



M = N 2 - Nt = - K* 1 = K, 



M* 
m "m 
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Since M* has some nonzero entry, the first inequality in Lemma ^2 and 
Lemma n 4 give, for big n, 



K M* > rn 9 m-2n-2 

Thus 

10 n " fc = lO n ~LW2j > m > 2 m - 2n - 2 n. 
Using the above asymptotics of m in terms of n, we arrive at the inequality 

exp((±loglO + o(l))n) > exp((« log 2 - 2 log 2 + o(l))n) 

= exp((log 5 + o(l))n) 

that is contradictory for big n because 10 1 / 2 < 5 = 10/2. This finishes the 
proof of Theorem 2. □ 

Remark. The above proof generalizes, with small modifications, to all bases 
6 > 4. We replace base 10 by 6, modify the proof accordingly, and have to 
satisfy two conditions. First, in the beginning of the proof we delete from the 
numbers with n base b digits those with > an zero digits, for some constant 
< a < 1. In order that we delete negligibly many, compared to b n , numbers, 
we must have 2 • (b — 1) 1_Q < b. Second, for the final contradiction we need 
that b a < 6/2. For b > 5, both conditions are satisfied with a = 1/2, as 
in case b = 10. For b = 4 they are satisfied with a = 0.49, say. However, 
for 6 = 3 they cannot be satisfied by any a. Thus, the case 6 = 3 seems to 
require more substantial changes. 
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